Sums and averages 

O ' Stephen Semmes 

. Rice University 

' Abstract 

These informal notes are concerned with sums and averages in various 
situations in analysis. 
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1 Real and complex numbers 

Let R be the field of real numbers. The absolute value of a real number x is 
denoted |x| and defined to be equal to x when x > and to —x when x < 0. 
Thus |a;| > 0, 

(1.1) \x + y\ < \x\ + \y\, 
and 

(1.2) \xy\ = \x\\y\ 
for every x, y £ R. 

If A is a nonempty set of real numbers with an upper bound, then there is a 
unique real number which is the least upper bound or supremum of A, denoted 
sup A. Similarly, a nonempty set A of real numbers with a lower bound has 
a greatest lower bound or infimum, denoted inf A. It is sometimes convenient 
to put sup A — +oo when A has no upper bound, or inf A = — oo when A has 
no lower bound. Normally we shall only be concerned with infima of sets of 
nonnegative real numbers here, which are also nonnegative. 

Let C be the field of complex numbers. A complex number z can be ex- 
pressed as x + y i, where i,j/eR and i 2 — — 1. In this and y are known 
as the real and imaginary parts of z, and are denoted Rez, Imz, respectively. 
The complex conjugate z of z is defined by 

(1.3) z = x — yi. 
In particular, z = z and 

(1.4) Rez = , lmz = . 

Observe that 

(1.5) z + w = z + w 
and 

(1.6) z~w=~zw 

for every z,w E C. The modulus of z = x + y i is the nonnegative real number 
given by 

(1.7) \z\ = (x 2 +y 2 y/ 2 . 
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Equivalently, 

(1.8) \z\ 2 = zz, 
and hence 

(1.9) |zio| = |,z|H 

for every z, w G C. 

Of course, | Rez|, | Imz < \z\ for every zeC. If z,w G C, then 



(1.10) |z + w| 2 = (z + w)(z+W) 

= |z| 2 + z w + ~z w + \i 
= \z\ 2 + 2 Rez w + \w 



(1.11) 

since zw — zw. This implies that 

(1.12) |z + w| 2 < \z\ 2 + 2\z\ \w\ + \w\ 2 = (\z\ + \w\) 2 , 
and therefore 

(1.13) \z + w\ < \z\ + \w\. 



2 Cesaro means 

As usual, a sequence {zj}JL of complex numbers converges to z G C if for 
every e > there is a nonnegative integer L such that 

(2.1) \zj-z\<e 

for every j > L. In this case, one can show that the sequence of averages 

ZlH l-Zn 



(2.2) Cr, 



n+ 1 



also converges to z as n — > oo. However, there are also sequences {zj}°^ 
of complex numbers that do not converge, but for which the corresponding 
sequence {( n }^Lo of averages does converge. For example, if zj = (— 1) J , then 
( n = when n is odd and ( n = l/(n + 1) when n is even, and lim n _>oo = 0. 
If z is any complex number, then 

n 

(2.3) (z- 1)JV = zn+1 ~ !. 

where z J = 1 for every z G C when j = 0. Hence 



(2-4) E 



z - 1 
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when z^l. It follows that 

1 

(2.5) lim -V 



n->oo n + - 

3=0 

when \z\ = 1 and z ^ 1. This extends the case of z = —1 described in the 
previous paragraph. 

Let Y^jLo a j De arL infinite series of complex numbers, and consider the 
sequence of partial sums 

l 

(2.6) = 

j=o 

By definition, a 3 converges if {bi}f^ converges, in which event 



(2.7) E^ = /i 



lim 



. oo 

J=0 



The average /3„ of 6o, . . . , b n is also given by 

(2.8) ^ = £^±Iz2« 3 



a ; . 



The series Xjlo a i * s sa ^ to ^ e Cesaro summable if {/J„}$£L converges. 
Consider the case of a geometric series 

oo 

(2.9) £V, 

where a G C and a 3 = 1 when j = again. If |a| < 1, then lirn 3 _ ) . 00 a J = 0, and 
the corresponding geometric series converges with 

oo 

(2.10) E« j = T^ 

by the previous computations. If \a\ > 1, then |a J | = |ap > 1 for every j, 
and the geometric series does not converge in the conventional sense. It is 
Cesaro summable with sum 1/(1 — a) when \a\ = 1 and a 1, by the earlier 
computations for the partial sums applied twice to estimate their averages too. 



3 Admissible series 

Let a j De an mnn it e series of complex numbers. If 

oo 

(3.i) J2 a > tj 

3=0 
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converges for some real number t < 1, then 
(3.2) lim a,j t j = 0, 

j->oc 

which implies that {a^t?}?^ is a bounded sequence. Conversely, if {djP}°^ 
is bounded, then 

oo 

(3.3) 

j=o 

converges for < r < t, by comparison with a convergent geometric series. 

Let us say that an infinite series Yl'jLo a J °^ com pl cx numbers is admissible 
if any of the previous conditions holds for every positive real number r < 1 or 
t < 1, as appropriate, so that each of the other conditions also holds for every 
0<r<lorO<t<l. This is the same as saying that 

oo 

(3.4) f(z) = J2"jZ j 

3=0 

has radius of convergence greater than or equal to 1. If fo; = X^ 7= o a j are the 



partial sums of X)jlo a i' tnen Xi*lo ^ i s admissible too. Indeed, aj = 0(R 
implies that bi = 0(R l ) for each R > 1. 
Put 6_i =0, so that 

n n n n 

(3.5) y. «> zj = - zj = E b i zj - E ■ zj - 

3=0 3=0 3=0 j=0 



■3) 



ri-1 



Since 

n n 

(3.6) E^ - E''< • : E^ 

.7=0 j = l j=0 

we get that 

n n— 1 n— 1 

(3.7) ^ a, z j = b 3 ( zj - zj+1 ) + b n z n = (1 - *>) E ^ z ' + 6 " z "' 

3=0 j=0 3=0 

If z < 1, then admissibility of Yl^Lo bn i m P nes that lim^oo b n z n — 0, and 

oo 

(3.8) f(z) = (l-z)J2bjz j - 

3=0 

4 Abel summability 

If a 3 converges, then it is well known that 



(4.i) P !™_X> r 



i — >-i- 

3=0 
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exists and is equal to ^ fLo a i • ^ n admissible series J^JLo a j °f complex num- 
bers is said to be Abel summable when this limit exists. 

For example, J2JLo 0,3 IS admissible for any complex number a with \a\ = 1, 
and 

oo 1 

(4.2) \^ a j z j = - 

^ 1 - az 

j=o 

for every z G C with \z\ < 1, which implies that Y^fLo 0,3 * s Abel summable to 
1/(1 — a) when Similarly, YlJLoij + 1) aJ 1S admissible when \a\ — 1, and 

oo ^ 

(4.3) E(i + 1 )«^ J '= (13^ 

for \z\ < 1, so that J2°°- j a? is Abel summable to 1/(1 — a) 2 when a ^ 1. 
Let {&j}°^ be a sequence of complex numbers, and consider 



Uj. 

3=0 



1 " 

(4.4) j> 

If {^n}^Lo converges, then 

(4.5) _JL^ n _ 1 = _L.yS 
vy n+l n+l^- 



n-l 



j=0 

converges to the same value. This implies that 

(4.6) _^ =/3n _ o 
n + 1 n + 1 

as n — > oo. If a i * s a Cesaro summable series of complex numbers, then 

one cam apply this to b n — Xj=o a j to S et triat 

(4.7) _^n_ = bn-b^ ^ 

v ' n+ 1 n+1 

as n — > oo. In particular, X^loO + 1 ) a3 ^ s not Ccsaro summable for any a e C 
with |a| = 1. 

If Yl'jLo a i * s Cesaro summable, then X^=o °J ^ s an admissible series, since 
a„ = 0(n). It is well known that J2'jLo a j ^ s Abel summable in this case, and 

with the same sum. For if 6; = X^=o a o are tne partial sums and c„ = X/Lo ^ 
are their partial sums, then 

oo oo oo 

(4.8) a > zj = - z ) E 5 ^ = (! - z ) 2 E c i 

J=0 j=0 j=0 
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when | z \ < 1, as in the preceding section. Equivalently, 

oo oo 

(4.9) 5>^ = (l-*) 2 £0' + 

3=0 3=0 

when \z\ < 1, where /?j = Cj/(j + 1) is the average of bo, ■ ■ ■ , bj, as before. Of 
course, 

(4-10) E0' + 1 )^ = 7T3T)2 

is the derivative of the usual geometric series. If the /3j's were constant, then 
the desired conclusion would follow immediately. If 0j — > as j — > oo, then one 
can check that X}j=o a i * s Abel summable with sum equal to 0. If X^_ Oj is 
Cesaro summable, so that {/3j}^L converges, then one can combine these two 
cases to show that X jlo a i 1S Abel summable with the same sum. 



5 Cauchy products 

If YlJLo a j an d Xz*lo h are infinite series of complex numbers, then their Cauchy 
product is the infinite series X^Lo ° n wnose terms are given by 

n 

(5.1) c n =y^dj b n -j. 

3=0 

Formally, 

CO oo oo 

(5.2) E c » = (E o ')(5>)' 

n=0 j=0 1=0 

and in particular this holds when aj = bi = for all but finitely many j and I. 
Moreover, (5.2) holds when a,j = for all but finitely many j or 6; = for all 
but finitely many I. Note that X^Lo c ™ z " * s tne Cauchy product of Y^jLo a j ^ 
and Xto ^ z ' ^ or ever y z e C. 

If X j!Lo a i ano - Xi*lo converge absolutely, which means that Xj*lo l a i 
and Xto 1^1 converge, then X^Lo c ™ converges absolutely, and (5.2) holds. In 
connection with this, observe that 

n 

(5.3) \c n \<Y,Wj\\bn-j\, 

3=0 

where the sum on the right side of the inequality corresponds exactly to the 
Cauchy product of Xjlo a i ano - Eto 1^1- One can also show that X^Lo c " 
converges and satisfies (5.2) when Xjlo a 3 ano - Xz*lo ^ converge and at least 
one of these two series converges absolutely. 

If aj,bj = 0(R J ) for some R > 0, then c n — 0(nR n ). This implies that 
X^L c ™ ^ s a dmissible when Xjlo a i ano - Xz^o ^ are a dmissible. In this case, 
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the corresponding power series J2jLo a j z " '> Eto h z l , and X)^Lo c ™ z ™ conver g e 
absolutely for every z G C with \z\ < 1, and satisfy 

oo oo oo 

(5.4) £ Cn z«=(]Ta^) (]>>/). 

n=0 j=0 Z=0 

If Y^JLo a j an( i Eto hi are Abel summable, then J2^=o ° n 1S Abel summable, 
and their Abel sums satisfy (5.2). This follows from the analogous statement for 
the corresponding power series on the unit disk, as in the previous paragraph. 



6 Norms on vector spaces 

Let V be a real or complex vector space. A real- valued function N on V is said 
to be a norm if N(v) > for every v G V, N(v) = if and only if v = 0, 

(6.1) N(tv) = \t\N{v) 

for every v £ V and real or complex number t, as appropriate, and 

(6.2) N(v + w) < N(v) + N{w) 

for every v, w G V. Thus the absolute value and modulus determine norms on 
R and C as one-dimensional vector spaces, respectively. 

Let n be a positive integer, and consider the spaces R" and C" of n-tuplcs 
of real and complex numbers. As usual, these are vector spaces with respect to 
coordinatewise addition and multiplication. Consider 

(6.3) Hi = M + -" + h»| 

and 

(6.4) |H|oo = max(|t>i|, . . . , \v n \) 

for v = (vi, . . . ,v n ) in R n or C™. It is easy to see that ||u||i and ||w||oo are norms 
on R™ and C". 

The standard Euclidean norm on R™ and C" is defined by 

(6.5) IHl2 = (hl 2 + -" + KI 2 ) 1/2 . 

This clearly satisfies the positivity and homogeneity requirements of a norm. 
The triangle inequality will be discussed in the next two sections. 
Observe that 

(6.6) IMIco < IMIi 
and 

(6-7) N|oo<N|2 

for every v in R™ or C™. One can also check that 

(6-8) IM| 2 <IMIi 

for every v in R™ or C n , using the first inequality. More precisely, 

(6.9) Hi < Hi IHloo < IHI?, 

where the first step follows directly from the definitions. 
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7 Inner product spaces 

Let V be a real or complex vector space again. An inner product on V is a 
real or complex- valued function (v, w) defined for v,w G V with the following 
properties. First, (v, w) is linear as a function of v for each fixed w G V. Second, 

(7.1) (w,v) = {v,w) 

for every v, w G V in the real case, and 



(7.2) (w,v) = {v,w) 

for every v,w G V in the complex case. This implies that (u, to) is linear in to 
in the real case, and that it is conjugate-linear in the complex case. This also 
implies that (v,v) G R for every v G V in the complex case. The third condition 
asks that 

(7.3) (v,v)> 

for every v £ V with w ^ 0. Of course, (0, 0) = by linearity. 

For each v G V, let ||v|| be the nonnegative real number defined by 

(7.4) \\v\\ = (v,v)^ 2 . 
The Cauchy-Schwarz inequality states that 

(7.5) \(v, W )\<\\v\\\\w\\ 

for every v, to G V. To show this, one can start with 

(7.6) (v + tw,v + tw) > 

for every t G R or C, as appropriate. This implies that 

(7.7) 2|t||M|<|M| 2 + |t| 2 H| 2 , 

by expanding the inner product and collecting terms, and choosing the sign of t 
in the real case or t/\t\ in the complex case to get the absolute value or modulus 
of the inner product on the left. The Cauchy-Schwarz inequality follows by 
taking \t\ = |M|/||w|| when v, to ^ 0. 

Using the Cauchy-Schwarz inequality, one gets that 

(7.8) ||w + w|| 2 = (v + to, v + to) 

< |h;|| 2 + 2||t;||||t i ,|| + || W || 2 = (||t;|| + || U ;||) 2 . 

Hence 

(7.9) \\v + w\\ < \\v\\ + \\w\\ 

for every u,to G V. Thus ||w|| is a norm on V, since the positivity and homo- 
geneity conditions are clearly satisfied. 
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The standard inner products on R n , C n are given by 

n 

(7.10) (v,w)=J2 v J w 3 

3 = 1 

for v = (vi, . . . , v„), w — (wi, . . . , w n ) € R™, and 

n 

(7.11) {v,w)=^2vjwj 

3 = 1 

in the complex case. The corresponding norm 
(7-12) IMI = (£N 2 ) 

is the same as the standard Euclidean norm ||u||2- 

8 Convexity 

A set E in a real or complex vector space V is said to be convex if 

(8.1) tv + (1 -t)w e E 

for every v,w G S and real number t such that < t < 1. For example, if iV is 
a norm on V, then the closed unit ball 

(8.2) B = {v e V : N(v) < 1} 
is convex. 

Conversely, if TV is a real-valued function on V which satisfies the positivity 
and homogeneity conditions of a norm, and if the corresponding closed unit 
ball B is convex, then one can show that N satisfies the triangle inequality and 
hence is a norm. For if v, w are nonzero vectors in V, then we can apply (8.1) 
with v = v/N(v), w = w/N(w), 

(8.3) *= N ^ 



N(v) + N(w) ' 
and E = B to get that 

(8 ' 4) N$)Vn(w) EB > 
which says exactly that 

(8.5) N(v + w) < N(v) +N{w), 

as desired. 
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A real- valued function <j> on the real line is convex if 



(8.6) 



<f)(tx+(l-t)y) <t<f>{x) + {l-t)<j>{y) 



for every x, y, t £ R with < t < 1. For example, 



(8.7) 



<f>(x) = \x\ 



p 



is convex when p > 1 . 

If p > 1 and v £ R" or C™, then put 



(8-8) IMIp=(EH P ) 



This satishes the positivity and homogeneity requirements of a norm, and one 
can use the convexity of 4>(x) = \x\ p to show that the corresponding unit ball is 
convex, and hence that \\v\\ p is a norm. 

9 A few estimates 

Observe that 

(9.i) Hloo<IHI P 

for every v £ R™, C and p > 1. If 1 < p < q < oo, then 



(9-2) \\v\\«<\\v\\ p p \\v\\^<\\vr p 




(9.4) ||v|| P <n 1/p ||f 



for every v £ R™ or C" and p > 1 . 
If 1 < P < q < oo, then 



(9.5) 



for every v £ R™ or C n . Equivalcntly, 



(9.6) 




More precisely, this is the same as 



(9.7) 
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which can be derived from the convexity of <f>(x) = \x\ q / p on the real line. 
If N is any norm on R" or C™, then 

(9.8) JV(")<^Nl2 

for some A > and every v € R™ or C™, as appropriate. This can be verified 
using the triangle inequality to estimate N(v) in terms of the norms of the 
standard basis vectors. The triangle inequality also implies that 

(9.9) N(v) - N(w),N(w) - N(v) < N(v - w) 
for every v,w E R" or C", and hence that 

(9.10) \N(v) - N(w)\ < N(v — w) < A\\v — w\\ 2 . 

This shows that N is continuous with respect to the standard topology on R" 
or C™. 

The unit sphere in R" or C" with respect to the standard Euclidean norm 
consists of the vectors v such that ||u||2 = 1. By compactness, the continuous 
function N attains its minimum on the unit sphere, which is positive. Thus 

(9.11) N(v) > a 

for some a > and every v € R™ or C n with \\v\\2 = 1- It follows that 

(9.12) a\\v\\ 2 <N(v) 
for every v £ R" or C™, by homogeneity. 



10 Operator norms 

Let V, W be vector spaces, both real or both complex, with norms || • || • \\w, 
respectively. A linear mapping T : V — >• W is said to be bounded if there is an 
A > such that 

(10.1) ||T(t;)|| w <A||t;||v 

for every v £ V. If V is R™ or C™ and \\v\\v — \\v\\ p for some p, 1 < p < oo, then 
every linear mapping T : V — > W is bounded, as one can see by expressing any 
v G V as a linear combination of the standard basis vectors. This also works 
for any norm on R™ or C™, since any norm is equivalent to the p-norms, as in 
the previous section. The same statement holds as well for any norm on any 
finite-dimensional vector space V, because V is then isomorphic to R™ or C n 
for some n. 

The operator norm of a bounded linear mapping T : V — > W is defined by 

(10.2) ||T|| op = sup{||2»|| w : v € V, \\v\\ v < 1}. 

Equivalently, (10.1) holds with A = ||T|| op , and ||T|| op is the smallest nonnega- 
tive real number with this property. One can check that the operator norm is a 
norm on the vector space of bounded linear mappings from V into W. 
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Suppose that V\, V2, and V3 are vector spaces, all real or all complex, 
equipped with norms as before, and that T\ : V\ — > V2 and T 2 : V2 — > V3 
are bounded linear mappings. It is easy to see that the composition T 2 °T\, 
defined by 

(10.3) (T 2 o Ti)(«) - T 2 (Ti(u)), w e 14, 
is a bounded linear mapping from V\ into V3, and that 

(10.4) ||T2oTi||op,i3 < ||Ti||op,i2 ||T 2 ||op,23, 

where the subscripts indicate the spaces involved in the operator norms. For 
any normed vector space V, the identity operator / defined by I(v) = v for each 
v 6 V is a bounded linear mapping from V into itself, and satisfies 

(10.5) ||/||op = l, 

using the same norm on V as both the domain and range. 
Suppose that V is R™ or C" equipped with the norm 

n 

(10.6) Nli=El^l' 

and let ei, . . . , e n be the standard basis of V, so that the jth coordinate of ej 
is equal to 1 and the other coordinates are 0. In this case, 

(10.7) ||T|| op - max{||T( ei )|| w , . . . , ||T(e„)|| w } 

for any linear mapping T : V — > W. If V is any vector space with any norm 
|| • || v and the norm || • on W is associated to an inner product (-,-)w, then 

(10.8) ||T|| op = sup{|(7»,«V| :veV, \\v\\ v <l,w€W, \\w\\ w < !}• 
Indeed, for each z G W, 

(10.9) \\z\\ w = sup{\(z,w) w \ :weW, \\w\\ w < 1}, 

since the inner product is bounded by the norm because of the Cauchy-Schwarz 
inequality, and equality occurs with w = z/\\z\\w when z ^ and with any w 
when z = 0. 



11 Linear mappings 

Let T be a linear mapping from a vector space V into itself, and let T J be 
the composition of j factors of T for each positive integer j. Thus T 1 = T, 
T 2 = T o T, etc., and it is convenient to take T° to be the identity mapping 
I on V. As in the case of real and complex numbers, one can consider infinite 
series of the form 

00 

(11.1) 

3=0 
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and limits of sequences of averages of the form 

I + T H h T n 

(11.2) . 

n 

One can also consider these expressions applied to individual vectors in V. 

More precisely, if V has finite dimension, then limits of sequences of vectors 
in V can be defined in terms of the corresponding sequences of coefficients with 
respect to a basis of V, and limits of linear mappings can be defined in terms of 
the entries of the corresponding matrices. By standard arguments, convergence 
is independent of the particular choice of basis of V. Convergence can also be 
defined with respect to a norm on any vector space, as in the next section, and 
is equivalent to using a basis when V has finite dimension. 

Suppose for instance that v € V is an eigenvector of T with eigenvalue A G R 
or C, as appropriate, so that 

(11.3) T(v) = Xv. 

For each j, 

(11.4) T j (v) = X j v, 

and we are back to sequences and series of real and complex numbers. If V 
has finite dimension and there is a basis of V consisting of eigenvectors of T, 
then T J is diagonalized by the same basis for each j, and the previous sequences 
and series of linear mappings are reduced to sequences and series of complex 
numbers. 

Remember that any linear mapping T on a finite-dimensional complex vector 
space of positive dimension has a nonzero eigenvector, as a consequence of the 
fundamental theorem of algebra. If T is not diagonalizable, then the behavior 
of 7 1 - 7 can still be analyzed in terms of the Jordan canonical form. 



12 Convergence 

Let ybea real or complex vector space equipped with a norm || • ||. A sequence 
{vj}j of vectors in V is said to converge to v G V if for every e > there is an 
L such that 

(12.1) IK-«||<e 

for every j > L. This is the same as the usual definition of convergence of a 
sequence of real or complex numbers when V = R or C and the norm is given 
by the absolute value or modulus. As usual, the limit of a sequence is unique 
when it exists. 

If {vj}j, {wj}j are sequences of vectors in V that converge to v,w G V, 
respectively, then the sequence of sums Vj + Wj converges to v + w. Similarly, if 
{vj}j converges to v in V, and {tj}j is a sequence of real or complex numbers 
that converges to t € R or C, as appropriate, then {tj vj}j converges to t v in V. 
These statements can be verified using standard arguments. An infinite series 
YlJLo a i w ^ n t erms m y converges if the corresponding sequence of partial sums 
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b n = Y^j=o a j converges in V. If J^JLo a j converges, then {aj}j converges to 
in V, as in the case of real or complex numbers. 

If V — R™ or C™ with norm \\v\\ p for some p, 1 < p < oo, then a sequence 
{vj}j converges to v E V if and only if the corresponding n sequences of co- 
ordinates of the vj's converge to the coordinates of v as sequences of real or 
complex numbers. This also works for any norm on R™ or C™, since all norms 
on these spaces are equivalent. There are analogous statements for any finite- 
dimensional vector space V and any basis in V, using a linear mapping that 
sends the given basis to the standard basis in R n or C™, as appropriate. 

Suppose that V and W are vector spaces, both real or both complex, and 
equipped with norms. The space of bounded linear mappings from V into W is 
also a vector space, and convergence of sequences and series in this space can 
be defined in terms of the operator norm. 



13 Completeness 

Let V be a real or complex vector space with a norm || • ||. A sequence {vj}j 
of vectors in V is said to be a Cauchy sequence if for every e > there is an L 
such that 

(13.1) \\vj -vi\\ <e 

for every j, I > L. Convergent sequences are Cauchy sequences, and V is said 
to be complete if every Cauchy sequence in V converges to an element of V. 
A complete vector space with respect to a norm is known as a Banach space, 
and it is a Hilbert space if the norm is determined by an inner product. It is 
well known that R and C are complete with respect to the usual absolute value 
and modulus. A sequence in R™ or C™ is a Cauchy sequence with respect to a 
p-norm if and only if the n sequences of its coordinates are Cauchy sequences in 
R or C, as appropriate. It follows that R™ and C" are complete with respect 
to the p-norms, since R and C arc complete. Hence R™ and C™ are complete 
with respect to any norm, by equivalence of norms. This implies in turn that 
finite-dimensional vector spaces are always complete. 

Let Y^jLo a j be an infinite series with terms in V. This scries converges 
absolutely if 

oo 

(13.2) £>J 

converges as an infinite series of nonnegative real numbers. The sequence of 
partial sums of an absolutely convergent series is a Cauchy sequence, just as for 
absolutely convergent series of real or complex numbers. If V is a Banach space, 
then it follows that every absolutely convergent series of vectors in V converges 
in V. 

Conversely, if every absolutely convergence series in V converges, then V is 
complete. For suppose that {vj}j is a Cauchy sequence in V, and let {uj,}^ 
be a subsequence of {vj}j such that 

(13-3) \K-v il+1 \\<2~ 1 
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for each I. Thus Y^ibLo( v ji — v ji+i) converges absolutely in V, and hence converges 
in V by hypothesis. This implies that {vj t }fZo converges in V, since 

n 

(!3-4) - ) = - «j n+ i 

for each n. Therefore {vj}j converges in V, because a Cauchy sequence with a 
convergent subsequence converges to the same limit. 

Let V and W be vector spaces, both real or both complex, and equipped 
with norms. If W is complete, then the vector space of bounded linear mappings 
from V into W is complete with respect to the operator norm. For if {Tj}j is 
a Cauchy sequence of bounded linear mappings from V into W, then {Tj(v)}j 
is a Cauchy sequence in W for each v G V. Because W is complete, {Tj(v)}j 
converges in W, and its limit determines a linear mapping T from V into W. 
One can check that T is a bounded linear mapping from V into W, and that 
{Tj}j converges to T in the operator norm. 

14 The supremum norm 

A continuous real or complex- valued function / on a topological space X is said 
to be bounded if there is a nonnegative real number A such that 

(14.1) |/C«0I<^ 

for every x £ X. In this case, the supremum norm of / is defined by 

(14.2) ||/IU P = sup{|/(z)| :x€X}. 

This is a norm on the vector space of bounded real or complex- valued continuous 
functions on X. 

Convergence of a sequence of bounded continuous functions on X with re- 
spect to the supremum norm is the same as uniform convergence. If {fj(x)}j 
is a Cauchy sequence of bounded continuous functions on X with respect to 
the supremum norm, then {fj(x)}j is a Cauchy sequence of real or complex 
numbers for each x G X, as appropriate. Hence {fj}j converges pointwise to 
a function / on X, and one can use the Cauchy condition with respect to the 
supremum norm to show that {fj}j converges uniformly to /. This implies that 
/ is bounded and continuous on X, by well-known results about uniform con- 
vergence. Thus {fj}j converges to / in the supremum norm, and the space of 
bounded continuous functions on X is complete with respect to the supremum 
norm. 

Let V be a real or complex vector space equipped with a norm ||u||y- A 
continuous function f on X with values in V is said to be bounded if ||/(x)||y 
is bounded as a real- valued function on X, in which event the supremum norm 
of / with respect to \\v\\v is defined by 

(14.3) ||/|| aup>v = sup{||/(!c)||v:a;eX}. 
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This is a norm on the vector space Cj,(X, V) of bounded continuous functions 
on X with values in V. If V is complete with respect to ||u||y, then Cb(X, V) is 
complete with respect to ||/|| SU p,y, by an argument like the one in the previous 
paragraph. 

For example, if v G V and <j> is a bounded continuous real or complex-valued 
function on X, depending on whether V is a real or complex vector space, then 
$(#) = 4>{x) v is a bounded continuous function on X with values in V, and 

(14-4) II$IUp,v = UWsup \\v\\v 

If / is bounded continuous V- valued function on X, then ||/(a;)||y is a bounded 
continuous real- valued function on X whose supremum norm is ||/|| auP) y. Of 
course, every continuous function on X with values in V is bounded when X is 
compact. 



15 Algebras 

Let A be an associative algebra over the real or complex numbers. Thus A is a 
vector space over the real or complex numbers equipped with a binary operation 
of multiplication a b which is associative in the sense that 

(15.1) (ab)c = a(bc) 

for every a, 6, c G A. More precisely, multiplication is asked to be a bilinear 
mapping, which means that a M- a b is a linear mapping for each b G A, and 
b i->- a b is linear for each a € A. A pair of elements a, b of A commute with 
each other if 

(15.2) ab = ba, 

and A is a commutative algebra if this holds for every a, b G A. 

It will be convenient to suppose also that there be a nonzero multiplicative 
identity element e in A, which is to say that 

(15.3) ae = ec=a 

for every a G A. An element a of A is said to be invertible if there is a b G A 
such that 

(15.4) ab=ba = e. 

In this case, the inverse b of a is unique, and denoted a -1 . If a is invertible, and 
c G A commutes with a, then c commutes with a -1 as well, since 

(15.5) a" 1 c — a^ 1 caa^ 1 — a^ 1 aca^ 1 — ca^ 1 . 

If ai, a 2 are invertible elements of A, then their product a\ a 2 is invertible too, 
and is given by 

(15.6) (ai a 2 ) _1 = a^ 1 a^ 1 . 
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Conversely, if a\, ai are commuting elements of A whose product a\ 0,2 is in- 
vertible, then a\ and 02 are each invertible. In this case, a\ a2 and hence its 
inverse commute with a\ and 02, and 

(15.7) a^ 1 = (ai a 2 )~ 1 a 2 , fl 2 1 = fti (fli a 2 ) _1 . 

For example, the real or complex-valued continuous functions on a topologi- 
cal space X form a commutative algebra with respect to pointwise multiplication 
of functions. The constant function equal to 1 at every element of X is the mul- 
tiplicative identity element in this algebra, and a continuous function / on X 
is invertible in this algebra if and only if f(x) ^ for every x G X, in which 
event the inverse of / is given by l//(x). The bounded continuous function on 
X also form an algebra which is a subalgebra of the algebra of all continuous 
functions on X. In order for a bounded continuous function / to be invertible 
in this subalgebra, it is necessary that there be an 77 > such that 

(15.8) |/(*)|>»J 

for every x 6 X, so that 1/ f(x) is bounded on X. If X is compact, then these 
two algebras are the same. 

If V is a vector space, then the linear mappings on V form an algebra with 
composition as multiplication. The identity operator / on V is the multiplicative 
identity element of this algebra, and a linear mapping T on V is invertible in the 
algebra if and only if it is a one-to-one mapping of V onto itself. If V is equipped 
with a norm, then the bounded linear mappings on V with respect to this norm 
form an algebra which is a subalgebra of the algebra of all linear mappings on 
V. Invcrtibility of a bounded linear mapping T on V in this subalgebra means 
that the inverse mapping T _1 is also bounded. If V has finite dimension, then 
these two algebras are the same. 



16 Banach algebras 

Let A be an associative algebra with nonzero multiplicative identity element e 
over the real or complex numbers, and suppose that A is equipped with a norm 
INI- If 

(16.1) l|a&||<||a||||b|| 

for every a, b s A and 

(16.2) ' ||e|| = l, 

then (A, ||a||) is said to be a normed algebra. In particular, this implies that the 
product of a pair of convergent sequences in A converges to the product of the 
limits of the sequences, just as for products of convergent sequences of real or 
complex numbers. 

If A is complete with respect to ||a||, then A is said to be a Banach alge- 
bra. For example, the algebra of bounded continuous real or complex-valued 
functions on a topological space is a Banach algebra. 
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Let V be a vector space over the real or complex numbers equipped with a 
norm. The algebra of bounded linear mappings on V is a normcd algebra with 
respect to the operator norm. If V is complete, then the algebra of bounded 
linear operators on V is a Banach algebra. 

17 Invert ibility 

Let (A, \\a\\) be a normed algebra with nonzero multiplicative identity element 
e. For each a £ A and positive integer j, let a 3 be the product of j factors of a, 
so that a 1 = a, a 2 = a a, etc., with a = e. Thus 

(i7.i) ii^ii < hp 

for each j. If ||a|| < 1, then {a J '}|l converges to in A, 

CO CO 

and hence X)^lo ^ converges absolutely in .A. 

If || a || < 1 and A is a Banach algebra, then it follows that YlJLo a ^ converges 
in A. For each n > 0, 

n n 

(17.3) (e-a)(£V) = a J ) (e - a) = e - a n+ \ 
which implies that 

OO OO 

(17.4) (e-a)(j2 a ') = ( £ a ') ( e " a ) = e > 

3=0 3=0 

since a n+1 — > as n — > oo. Therefore, e — a is invertible, with 

CO 

(17.5) (e-aJ-^XV- 

This is a fundamental property of Banach algebras. 
Let a; be an invertible element of A, so that 

(17.6) l = \\xx- 1 \\<\\x\\\\x- 1 \\. 
If y € A satisfies 

( 17 - 7 ) \\ x ~y\\ <jj^i\> 

then 

(17.8) y = x - (x - y) = x (e - x^ 1 (x - y j) 
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is invertible by the remarks of the previous paragraph. In particular, the in- 
vertible elements of A form an open set. 
If ||a|| < 1, then 

OO OO I. I 

(17-9) ||(e - a)" 1 - e\\ = \\^\\ < £ ||a||' = 

Hence x n> x^ 1 is a continuous mapping on the set of invertible elements of A. 



18 Submultiplicative sequences 

A sequence {rj}°^ 1 of nonnegative real numbers is said to be submultiplicative 
if 

(18.1) r j+l <r jn 
for every j,l>l. In this case, 

(18.2) Km r]' n = inf 

n^oo n>l 

where the existence of the limit is part of the conclusion. To see this, observe 
that 

(18.3) r jn+l <(r n ) j (n) 1 
and hence 

(18.4) (r 3 n+i) 1/Un+l) < [(rvjV^WO'rc+O ( ri )'/0""+0 

for every j, l,n > 1. If n is fixed, then the right side can be approximated by 
(fn) 1 /™ for j sufficiently large and < I < n. 



19 Invert ibility, 2 

For each element x of a Banach algebra A, rj = ||a^'|| is a submultiplicative 
sequence of nonnegative real numbers. Put 

(19.1) p(x) = lim |ta; Il || 1/l1 = inf \\x n \\ 1 / n < \\x\\, 

n— >oo n>l 

which is known as the spectral radius of x, at least in the case of complex Banach 
algebras. Thus 

(19.2) p(tx) = \t\p(x) 

for each real or complex number t, as appropriate. 

If p{x) < 1, then X^=o I! 2 '"' II converges, and hence X^'=o x ° converges in A. 
As before, the sum is equal to the inverse of e — x under these conditions. Note 
that p{x) < 1 if and only if Hcc™!! 1 /™ < 1 for some n, which is the same as 
||x"|| < 1. Invertibility of e — x can also be obtained from the invertiblity of 
e — x n , since the latter is the product of the former and e + x + ■ ■ ■ + a;" -1 . 
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If p(x) < 1 and t is a real or complex number, such that \t\ < 1, as appro- 
priate, then consider 

oo 

(19.3) ^VV = (e-tx)' 1 . 

3=0 

If e — x is invertible, then (e — tx)~ x extends continuously to a neighborhood 
of t = 1. Conversely, if there is a sequence {tj}j such that \tj\ < 1 for each j, 
{tj}j converges to 1, and (e — tj x)~ l converges in A, then e — x is invertible 
and the limit of (e — tj x)^ 1 is equal to its inverse. 

Suppose that A is the algebra of bounded continuous real or complex- valued 
functions on a topological space X, with the supremum norm. In this case, 

(19.4) liriUHI/IILp 

for each n > 1 and /, and hence p(f) = ||/|| sup . 

20 Spectrum 

Let T be a linear mapping on a real or complex vector space V. To say that a 
real or complex number A, as appropriate, is an eigenvalue of T means exactly 
that T — XI has nontrivial kernel. In particular, T — XI is not invertible. 
Conversely, if V has finite dimension and T — X I is not invertible, then T — XI 
has nontrivial kernel, and A is an eigenvalue of T. 

Suppose that V is equipped with a norm || • and that T is a bounded 
linear operator on V. If v is a nonzero eigenvector of T with eigenvalue A, then 

(20.1) \X\\\v\\ v = \\T(v)\\v < \\T\\ op \\v\\ v 
implies that 

(20.2) |A| < ||T||op. 

Similarly, A" is an eigenvalue of T n for each positive integer n, and hence 

(20.3) |A|" < ||T"|| op . 
Thus 

(20.4) |A| < ||T"||i/" 

for each n, and therefore |A| < p{T). 

Let (A, || • ||) be a Banach algebra with nonzero multiplicative identity ele- 
ment e, let x be an element of A, and let A be a real or complex number, as 
appropriate. If |A| > ||a;||, then 

(20.5) Ae-z = A(e-A- 1 i) 

is invertible. The same conclusion holds when |A| > p(x). Equivalently, 

(20.6) |A| < p(x) 
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when A e — x is not invertible in A. 
The set 

(20.7) a{x) — {A : Ae — x is not invertible in A} 

is known as the spectrum of x in A, especially in the complex case. This is a 
closed set in R or C, as appropriate, since the complementary resolvent set of 
A such that A e — x is invertible is an open set when A is a Banach algebra. If 
A is a complex Banach algebra, then a famous theorem states that a(x) ^ for 
every x E A. Basically, if a(x) = 0, then 

(20.8) (Ae-x)- 1 

would be a nonconstant holomorphic .4-valued function on the complex plane 
that tends to as |A| — > oo, a contradiction. This is still a holomorphic A- 
valued function on the complement of <j(x) in the complex plane for any x E A. 
Another famous theorem states that 

(20.9) p(x) = max{|A| : A E a{x)} 

when A is a complex Banach algebra, as a consequence of the convergence of 

oo 

(20.10) x j 

3=0 

when a is a nonzero complex number such that |A| < l/\a\ for every A E <r(x). 



21 Averages in normed algebras 



Let (A, || • ||) be a normed algebra with nonzero multiplicative identity element 
e. If x E A and ||ir|| < 1, then 

( 21 -i) |E^||<Ei^ii<EiNi J <Ti 



3=0 



3=0 



3=0 



for each n. Hence 
(21.2) 



lim 

rwoc n + 



_W = o. 
+ 1 ^ 



3=0 



Suppose now that ||x|| = 1. Thus 



(21.3) 



3=0 

for each n. Of course, 
(21.4) 



1 n w n 1 n 



3=0 



3=0 



n 

— Y 

n + 1 ^ 
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for each n when x = e. 

For any iei and n > 1, 

n 

(21.5) {e-x)^x> =e-x n+1 . 

3=0 

This implies that 

n 

(21.6) £V = (e - x)- 1 {e - x n+1 ) 

3=0 

when e — x is invertible in A. If = 1 and e — x is invertible, then 

n 

(21.7) |£V || < || (e - x)- 1 ]] \\e x n+1 \\ < 2 ||(e - x)" 1 )!, 

so that (21.2) holds in this case too. 
If e — x is invertible, then 

n I n 

(21.8) £E* j = ^(e-x)- 1 ^-^ 1 ) 

= (n + 1) (e - x)- 1 - (e - x)- 2 (x - x n+2 ) 
for each n. If in addition ||a;|| = 1, then it follows that 

n I 

(21.9) lim V ' = (e-x)-\ 

1=0 j=0 

22 Invert ibility, 3 

Let A be a Banach algebra with nonzero multiplicative identity element e. Sup- 
pose that x £ A has the property that the sums 

n 

(22.1) £V 

3=0 

are uniformly bounded in A. In particular, X — C ^ X ^ X ^ • . . IS a bounded 
sequence in A, so that 

oo 

(22.2) r ° x3 

3=0 

converges absolutely when < r < 1. Using summation by parts, we get that 

(22.2) is equal to 

oo I 

(22.3) (l-r)^>< (]>>■>). 

Z=0 j=0 

Thus the boundedness of (22.1) implies the boundedness of (22.2). Hence the 
inverse of e — r x has bounded norm when < r < 1, since it is given by (22.2). 
It follows that e — x is invertible, by the results of Section 17. 
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23 The open mapping theorem 

Let (V, || • ||v), (W, || • \\w) be Banach spaces, both real or both complex, and 
let T be a bounded linear mapping from V into W. If T maps V onto W , then 
Banach's open mapping theorem says that T sends open subsets of V to open 
subsets of W. In particular, if T is a one-to-one mapping of V onto W, then it 
follows that T _1 is a bounded linear mapping from W onto V. 
For each r > 0, put 

(23.1) B v (r) = {v £ V : \\v\\ v < r}, 

and let Bw{r) be defined in the same way. Because T is linear, it suffices to 
show that there is an r > such that 

(23.2) T(B V (1)) D B w (r). 
The hypothesis that T map V onto W implies that 

oo 

(23.3) (J T(B v (n)) = W. 

71=1 



Hence W is the union of the closure T{By{n)) of T(By{n)), n > 1, and it 
follows from the Baire category theorem that T{Bv{n)) contains a nonempty 
open set for some n. Using linearity, one can show that there is an n > such 
that 

(23.4) T(B V {1)) D B w ( n ). 

Let us use completeness of V and linearity of T to show that 

(23.5) T(B V (2)) D B w (n). 

Let w £ Bw(n) be given. By (23.4), there is a vo £ By(l) such that 

(23.6) ||t«-T(«o)|| w < y. 

Applying the same argument to 2(w—T(vo)), we get that there is a v\ £ By(\jT) 
such that 

(23.7) \\ w -T(vz)-T{vr)\\w< r -\. 
Repeating the process, we get v ,vi,v 2 , ■ ■ ■ £ V such that 

(23.8) \\v j \\v<2- j 
for each j, and 

(23.9) \\w-J2 T (vj 



<2- l - 1 n 
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for each I. Thus 

oo 

(23.10) ^2\\v j \\ v <2, 

3=0 

and so YlJLo v j converges in V and satisfies 

oo 

(23.11) ||X>L <2 - 

3=0 

Moreover, 

oo 

(23.12) T^vj)=^ 

3=0 

which implies (23.5). 

24 The uniform boundedness principle 

Let V and W be vector spaces, both real or both complex, equipped with norms 
| • || v an d || • || w, respectively. Let Ti,T 2 , . . . be a sequence of bounded linear 
mappings from V into W such that {Tj(v)}°Z 1 is a bounded sequence in W for 
each v eV. If V is complete, then a theorem of Banach and Stcinhaus implies 
that the operator norms of the Tj's are bounded. 

As a variant of this, let /i, /2, • • • be a sequence of nonncgative real- valued 
continuous functions on a metric space M such that {fj{x)}J^ l is bounded for 
each x e M. If M is complete, then there is a nonempty open set in M on 
which the /j's are uniformly bounded. To see this, consider 

(24.1) E n = {x e M : f 3 {x) < n for each j}, 

which is a closed set in M for each n by continuity. The hypothesis of pointwise 
boundedness means exactly that 

oo 

(24.2) |J E n = M, 

n=\ 

and the Baire category theorem implies that E n contains a nonempty open set 
for some n, as desired. 

Let us apply this to M = V and fj(v) = \\Tj(v)\\w Note that ||Ij(u)|| w 
is a cintinuous function on V, since Tj : V — > W is a bounded linear mapping. 
Because of linearity, uniform boundedness of ||Tj(v)||w on a nonempty open set 
in V implies that the operator norms of the Tj's are bounded. 

In practice, we are especially interested in situations where {Tj(v)}°^ 1 con- 
verges in W for each v 6 V. 
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25 Strong operator convergence 



Let V and W be vector spaces, both real or both complex, with norms || • ||y 
and || • || iv- A sequence Ti, T2, . . . of bounded linear mappings from V into W is 
said to converge strongly to a linear mapping T from V into W if the following 
two conditions are satisfied. First, the operator norms of the Tj's are uniformly 
bounded, so that there is an A > such that 

(25.1) lollop < A 
for each j. Second, 

(25.2) lim Tj(v) =T(v) 

in W for each v G V. It follows that T is a bounded linear mapping from V 
into PF, with 

(25.3) ||T||„p < A. 

Of course, convergence in the operator norm implies strong convergence. If V 
is complete, then the pointwise convergence of a sequence of bounded linear 
mappings on V implies the boundedness of their operator norms, as in the 
previous section. Conversely, if the operator norms of the Tj's are uniformly 
bounded, and if T is a bounded linear mapping from V into W, then convergence 
of Tj(v) to T(v) for every v in a dense set in V implies the same property for 
every v G V . Similarly, if the operator norms of the Tj's are uniformly bounded, 
and if {T ) (w)}°^ 1 is a Cauchy sequence in W for each v in a dense set in V, then 
{Tj(v)}j^L 1 is a Cauchy sequence in W for every v G V. If W is complete, then 
it follows that {Tj}JL l converges strongly to a bounded linear mapping from V 
into W. 



26 Convergence of averages 

Let V be a vector space with a norm || • || , and let T be a bounded linear operator 
on V with ||T|| op < 1. Thus 



(26.1) 



^ ri 1 n 1 n 

j=0 j=0 j=0 

for each n. 

For v G V, let us consider the convergence in V of 



(26.2) 



1 " 
n+ 1 ^ w 



3=0 



If T(v) = v, then T^v) = v for each j, and (26.2) is equal to v for every n. 
If v = T(u) — u for some iieV, then 



(26.3) 



£t» = t" +1 h 
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for each n. Hence (26.2) tends to as n — > oo, since T n+1 (u) is bounded. The 
same conclusion holds when v is in the closure of the set of T{u) — u, u e V . 
Observe that 

(^•4) T(^±THv))-^±Ti {v) = -±- i{ T^ {v) - V ) 

3=0 j=0 

converges to as n — > oo for every v € V. If (26.2) converges for some !i£l / , 
then it follows that the limit is an eigenvector of T with eigenvalue 1. 



27 Hilbert spaces 

Let (V, (-, •)) be a Hilbert space, and let E be a nonempty closed convex set in 
V. Let v 6 V be given, and consider 

(27.1) r = m{{\\v-w\\ :weE}. 

Let wi,W2, ■ ■ ■ be a sequence of elements of E such that 



(27.2) 



lim \\v — Wj\\ = r. 



Because E is convex, (wj + wi)/2 € E for every j, I, and hence 



Wj + Wl 



> r. 



(27.3) 

The parallelogram law 

(27.4) ||a-6|| 2 + ||a + 6|| 2 = 2||a!| 2 + 2||6|| 2 

with a = v — wj, b = v + wj implies that 



(27.5) |K- - 

It follows that 
(27.6) 



Wl \\ 2 + 



2v- Wj - Wl \\ 2 = 2\\v- W:j \\ 2 + 2\\v- Wl \\ 2 . 



lim || — wi\\ = 0, 



which is to say that {wj}J^ 1 is a Cauchy sequence in V. This sequence con- 
verges, because V is complete, and its limit w is an element of E and satisfies 



(27.7) 



v — w\ 



r. 



Let us apply this to a closed linear subspace W of V. For each v £ V, the 
preceding argument implies that there is a w € W such that 



(27.8) 



\v — w\\ < \\v — w — z\ 
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for every z GW. By standard computations, this implies in turn that 

(27.9) (v-w,z)=0 

for every z G W, in the same way that the minimum of a function is attained 
at a critical point. Conversely, the latter condition implies that 

(27.10) \\v-w- zf = \\v - w\\ 2 + \\z\\ 2 > \\v - w\\ 2 

for every z G W. If u G W also satisfies (v — u, z) — for every z £ W, then 

(27.11) || — w\\ 2 = (u — w, u — w) = (u — v, u — w) + (v — w, u — w) — 0, 

since u — w G W , and hence u = w. 

Consider the closed linear subspace W L of V defined by 

(27.12) W 1 - = {y G V : (y, z) = for every z G W). 

The previous arguments show that every element of V can be expressed in a 
unique way as the sum of an element of W and an element of W- 1 . Let us check 
that 

(27.13) W = (W ± ) ± . 

Every element of W is contained in {W ) by definition, and so it suffices to 
check that x G (W ± ) ± is in W. If x = w + y for some w G W C (V^- 1 )- 1 and 
y G then it follows that y G (W^)^, so that (y, y) = 0, y = 0, and x = w. 
If is any linear subspace of V, then can be defined in the same way, 
and is a closed linear subspace of V. Of course, the closure W of W is a closed 
linear subspace of V, and one can check that 

(27.14) W 1 - = W^. 

Therefore W = (W^) 1 - = (W^. 

If W is a closed linear subspaces of V, then the orthogonal projection P = Pw 
of V onto W is the linear mapping on V defined by 

(27.15) P(v) = w 
for v = w + y with w G W, y G W 1 - . Thus 

(27.16) ||P(„)|| < || V || 
for every v G V, since 

(27.17) ll«l| 2 = II^H 2 + bl| 2 

in this case. In particular, ||P|| op = 1 except when W — {0} and P = 0. 
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28 Unitary transformations 

Let (V, (•, •)) be an inner product space. A one-to-one linear mapping T from 
V onto itself is said to be unitary if 

(28.1) (T(v),T(w)) = (v,w) 

for every v, w G V. This may also be described as an orthogonal transformation 
in the real case. If T is unitary, then 

(28.2) \\T(v)\\ = \\v\\ 

for every v G V, by taking v — w in the previous equation. Conversely, a 
linear mapping T of V onto itself which preserves norms is one-to-one and sat- 
isfies (28.1), by a polarization argument. Unitary transformations are obviously 
bounded, with operator norm 1. Note that the inverse of a unitary transforma- 
tion is also unitary. 

Suppose now that V is complete, which is to say that V is a Hilbert space. 
Consider the linear subspace 

(28.3) W = {T(u) -u : u G V}. 
Thus T(W) = W, since T(V) = V. Because T is unitary, 

(28.4) (y,T(u)-u) = (T- 1 (y)-y,u) 

for every u,y G V. It follows that W 1 - consists of the y G V such that T^ 1 (y) — 
y, which is equivalent to T{y) = y. 

As in the previous section, every v G V can be expressed in a unique way as 
w + y, where w G W, y G W 1 - , and hence T{y) = y. This implies that 

(28.5) ^ TT E^) = ^ TT E^H+^ 

3=0 j=0 

converges to y in V as n — > oo, as in Section 26. 

Let T\y be the restriction of T to VK, and let Iw be the identity mapping 
on W. If Iw — Tw has a bounded inverse on W, then 



n 

(28.6) ^BlW) 



n+ . . 

3=0 

converges to in the operator norm on W as n — > oo, as in Section 21. In this 
case, 

n 

(28.7) V T J 

n+ 1 ^ 

converges to the orthogonal projection of V onto W 1 - in the operator norm. 
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29 Rotations 

Put 

(29.1) T = {a £ C : \a\ = 1}. 

For each a £ T, let R a be the linear mapping on the vector space of continuous 
complex- valued functions on T defined by 

(29.2) R a (f)(z) = f(az) 

for each z £ T. This is an isometry of the space of continuous functions on T 
onto itself with respect to the supremum norm. 
If f(z) = z l for some integer I, then 

(29.3) R a {f)=a l f, 

so that / is an eigenvector of R a with eigenvalue a 1 . Note that f(z) can also 
be expressed as z~ l , since \z\ = 1. It follows that 

n 

( 2 9- 4 ) ^TTE^(/) 

3=0 

converges uniformly on T as n — > oo when / is one of these eigenfunctions, or 
a finite linear combination of these eigenfunctions. 

It is well known that the finite linear combinations of the functions z l , I £ 
Z, are dense in the space of continuous functions on T with respect to the 
supremum norm. This implies that (29.4) converges uniformly on T for every 
continuous function / on T. 

One can also consider the Lebesgue spaces L P (T) of measurable complex- 
valued functions / on T such that \f\ p is integrable with respect to Lebesgue 
measure on T, 1 < p < oo. For each a £ T, R a determines an isometric linear 
mapping of L P (T) onto itself, which is a unitary mapping when p = 2. The sum 

(29.4) also converges in the L p norm for every / £ L P (T) when 1 < p < oo, 
since it converges for a dense class of functions. 



30 Fourier series 

If / is a continuous complex- valued function on the unit circle T and a £ T, 
then 

(30.1) f f(az)\dz\= f f(z)\dz\. 

Jt Jt 

Here \dz\ denotes the element of arc length integration on T, which is invariant 
under rotations. This implies that 



(30.2) / z l \dz\ = 

for every nonzero integer I. 



T 
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For any pair of continuous functions /i, /2 on T, put 

(30.3) {fuf2)T = ^J h{z)W)\dz\. 

This defines an inner product on the vector space of complex- valued continuous 
functions on T, which extends to the Lebesgue space L 2 (T) of square- integrable 
functions on T, so that the latter becomes a Hilbert space. 
Observe that 

(30.4) {z°,z l ) T = whcnj/Z 

= 1 when j = I. 

Thus the functions z l , I E Z, are orthonormal with respect to this inner product. 
The Fourier coefficients of a function / on T arc defined by 

(30.5) f(j) = (/, zJ) T = i- jf f(z) z~ j \dz\, 
and the Fourier series associated to / is 

oo 

(30.6) 

J=-oo 

It is well known that the Abel sums of the Fourier series of a continuous function 
/ on T, given by 

oo 

(30.7) f(j)r ]j] z j , < r < 1, 

j=— oo 

converge to f(z) uniformly on T as r — > 1, because these sums can be expressed 
as averages of / concentrated near z on T. Similarly, the Cesaro means 

n l 
1=0 j=-l 

converge to f(z) uniformly on T as n — >• oo. 



31 Measure-preserving transformations 

Let X be a set with a cr-algebra of measurable subsets and a positive measure \x. 
Let 4> be a one-to-one mapping of X onto itself such that E Q X is measurable 
if and only if <j>(E) is measurable, and 

(31.1) n(4>(E)) = »(E) 

for every measurable set E C X. Conside the linear mapping 

(31.2) T(/) = /o0 
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acting on measurable functions / on X. If / is an integrable function on X, 
then T(f) is too, and 

(31.3) / T{f)dn= f fdfx. 

Jx Jx 

Similarly, T determines an isometry of L P (X) onto itself for each p, 1 < p < oo. 

In particular, T defines a unitary mapping on the Hilbcrt space L 2 (X). As 
in Section 28, 

n 

(31.4) ^TTE TJ '(/) 

j=o 

converges in L 2 {X) for each / G L 2 (X). For any p, 



. n -. n 

< 3L5 > ||^E^)|| p <^Ei™ii 



3=0 ' j=0 

for each / G L' P (X) and n > 0, since T is an isometry on L P (X). If / is in ^(X) 
and L°°(X), then / G L P (X) for 1 < p < oo, and the previous statements imply 
that the averages converge in L P (X) for 1 < p < oo. Because L X (X) (1 L°°(X) 
is dense in L P (X) for 1 < p < oo, it follows that the averages converge in L P (X) 
for every / G i p (X) when 1 < p < oo. 

If /u(^) < oo, then L 2 (X) is already a dense linear subspace of i 1 (X). 
In this case, convergence in L 2 (X) implies convergence in L X (X), and so the 
averages converge in L X (X) for every / G L 2 (X). Hence the averages converge 
in L X (X) for every / e L^X). 

Suppose that X is the set Z of integers, and that \i is counting measure on 
Z. If <p(l) = I + 1 for each n and / is a function equal to 1 at one point and to 
elsewhere, then the averages have L 1 norm equal to 1 for each n, but converge 
to uniformly on X. 



32 Sequence spaces 

Let A be a finite set with at least two elements, and let X be the set of doubly- 
infinite sequences x — {xj}j^._ (x with Xj G A for each j. Thus X is a compact 
Hausdorff space with respect to the product topology defined using the discrete 
topology on A. Let <j> : X — > X be the shift mapping such that the jth term of 
4>{x) is equal to Xj+\. This is a homeomorphism from X onto itself. Suppose 
that to each a G A is assigned a nonnegative real number w(a) such that 

(32.1) £>(a) = l. 

aeA 

This defines a probability measure on A, which leads to a probability measure 
on X. This probability measure on X is preserved by the shift mapping (f>. 
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33 Maximal functions 

Let / be a locally-integrable function on the real line. Consider the Hardy- 
Littlewood maximal function /* associated to /, defined by 

(33.1) f*(x) = sup ±- [ \f(y)\dy, 

x£l | J | J I 

where the supremum is taken over all open intervals / = (a, b) in R that contain 
x, and where |/| denotes the length b — aoil. Thus f*{x) > by construction, 
and f*(x) = +oo is certainly possible. The mapping / i-> /* is sublinear in the 
sense that 

(33.2) (/i+/ 2 )*</r+/ 2 * 
for any locally-integrable functions / on R, and 

(33.3) (*/)* = |*l /* 

for any locally-integrable function / on R and constant t. If / is bounded, then 
/* is bounded, and 

(33.4) H/loo < H/Hoo. 

If / is any locally-integrable function on R, then /* is lower semicontinuous. 
This means that 

(33.5) E x = {x e R: f*{x) > A} 

is an open set for each A > 0. For if f*(x) > A for some ieR and A > 0, then 

(33.6) ±.Jtf(y)\ dy> x 

for some open interval / in R that contains x. This implies in turn that /* > A 
on /, amd hence that I C E\, as desired. 
If / is an integrable function on R, then 

(33.7) \E x \<lJ\f(y)\dy 

for every A > 0, where \E\\ is the Lebesgue measure of E\. In particular, 
/* < oo almost everywhere on R. To prove this estimate, it suffices to show 
that 

(33.8) \ R \~\I 



R 



for every compact set K C E\. 

By compactness, there are finitely many open intervals h, . . . ,I n such that 

(33.9) ±.J^ f{y) \ dy>X 
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for each and 

n 

(33.10) KclJb. 

i=i 

We may also suppose that each x G R is contained in at most two of the ij's. 
For if a point is contained in three of these intervals, then one of the intervals is 
contained in the union of the other two, and may be dropped from the collection. 
This implies that 

(33.11) \K\<J2\Ii\ < El / l/(»)l d » 

\! \f(y)\dy 



< 



< 



lj n \f(y)\dy, 



as desired. 



34 L p estimates 

Let be a nonnegative measurable function on the real line, and let p be a real 
number with p > 1. Observe that 

r r rH x ) 

(34.1) / <j>(x) p dx = / / pX^dXdx 

Jr. JrJo 

> 

pX 13 - 1 |{ieR: <j>(x) > X}\ dX. 



I 

Jo 



We would like to apply this to <p — /*, where / G L P (R) and p > 1. 
For each A > 0, let f\ be the function defined by 

(34.2) f x (x) = f(x) when 1/(^)1 < ^ 

= otherwise. 

Thus 

(34.3) /*(*) < K(x) + (/ - fx)*(x) < \ + (/ - fx)(x)* 

for every x G R, by subadditivity of /* and (33.4), which implies that 

(34.4) {x G R : f*(x) > X} C ja; G R : (/ - / A )*(a;) > 
Combining this with (33.7) applied to / — fx, we get that 

(34.5) \{x G R : /* (*) > A}| < 1 / |/(y)|d|/ 
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for every A > 0. Hence 



(34.6) / t{x) p dx< / 4 p \p- a / \f(y)\dyd\. 

JR JO J{yeR:\f(y)\>\/2} 

Interchanging the order of integration, we get that 

(34.7) I f*{xfdx<[ [ 2lKV)l 4p\P- 2 \f(y)\d\dy. 



JR JRJO 

If p > 1, then it follows that 

(34.8) f f*{xydx < f \f(y)\ p dy. 

Jr v - 1 Jr 

If / is not equal to almost everywhere on R, then one can check that f*(x) 
is at least a positive constant multiple of l/\x\ when \x\ is sufficiently large. This 
implies that /* is not integrable on R, although it can still be locally integrable. 



35 Maximal functions, 2 

Let (M,d(x,y)) be a metric space. For each x £ M and r > 0, the open ball 
with center x and radius r is defined by 

(35.1) B(x, r) = {y G M : d(x, y) < r}. 

If B = B(x, r) is an open ball and a > 0, then a B denotes the ball B(x, a r). 

Suppose that Bi, B 2 , ... is a finite sequence of open balls in M, or an infinite 
sequence of balls whose radii converge to 0. Let j3\ be one of these balls with 
maximum radius, and let 02 be another one of these balls which is disjoint from 
(3i and has maximal radius, if there is one. Continuing in this manner, we 
get a finite or infinite sequence of pairwise disjoint balls /3i,/?2, • • • chosen from 
Bi, B2, ■ ■ ■ such that for each Bi there is a 0j which intersects Bi and has radius 
at least that of Bi. Hence Bi C 3 f3j, and so 



(35.2) U B ^IM 



A positive Borel measure \x on M is said to be doubling if the measure of 
each open ball in M is positive and finite, and if there is a C > such that 

(35.3) fj,(2 B) < C fi(B) 

for every open ball B in M. If / is a locally integrable function on M with 
respect to fi, then the corresponding maximal function /* is defined by 

(35.4) /*(*) = sup -5- f \f\dn, 
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where the supremum is taken over all open balls B in M with x £ B. As in 
[7, 8], one can show that /* satisfies properties like those in the previous sections, 
using the covering lemma described in the preceding paragraph in particular. 

For example, this applies to Lebesgue measure on R™ equipped with the 
standard metric. As in [32], one can also consider R™ +1 equipped with a metric 
associated to parabolic dilations. In addition to nonstandard dilations, there 
can be more complicated geometric structure related to nilpotent Lie groups. 
Many common fractals are equipped with natural doubling measures. 



36 Discrete maximal functions 

If / is a function on the set Z of integers, then the corresponding maximal 
function is defined by 

(36.1) /*(/)= sup _L^£|/(j)|, 

n<i<h o — a + 1 



j=a 



where more precisely the supremum is taken over all integers a, b with a < I <b, 
and may be infinite. As before, / i-> /* is sublinear, /* is bounded when / is 
bounded, and satisfies 

(36.2) sup /*(/)< sup |/(Z)|. 

lez lez 

If wc identify functions on Z with functions on R that are constant on intervals 
of the form [I, I + 1) for I S Z and have the same values at the integers, then 
this maximal function is obviously less than or equal to the previous one on R. 
Using this observation, or arguments analogous to the earlier ones, it is easy to 
see that 

_ oo 

(36.3) | {jeZ:/ * (i ) >A }|<- £ |/(j)| 

j=—oo 

for every A > 0, where \E\ is the number of elements of E C Z, and 

(36.4) J2 rur < E i^')i p 

j — — oo j=— oo 

when 1 < p < oo. 



37 Another variant 

Let A be a set with a positive measure /z, and consider X x Z with the measure 
which is the product of /j, and counting measure. If / is a measurable function 
on X x Z, which is to say that f(x, I) is measurable in x for each I, then consider 
the maximal function /* defined by 

1 b 

(37.1) f*(x,l) = sup J2\f( x ,j)\, 

a<i<b o — a + 1 
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which is the maximal function of f(x,l) in I for each x <E X. Thus / i— )• /* is 
sublincar, /* is bounded when / is bounded, and 

(37.2) H/IU < H/lloo- 
Moreover, 

2 00 f 

(37.3) J] e X : > A}) < - £ / 

j— — oo j — — oo 

for each A > 0, and 

(37.4) f; f t( X ,jy d »{x)<^^- jr [ \f(x,jrd^ X ) 

when 1 < p < 00. 

38 Transference 

Let X be a set with a positive measure /z, let be a one-to-one measure- 
preserving mapping of X onto itself, and let 

(38.1) T(f) = fod> 

be the corresponding linear transformation on measurable functions on X. Con- 
sider the maximal function 

(38.2) A*(f)(x) = sup — i— J2 \T j (f)(x)\, 

k,l>o k + l + l ^ 

where the supremum is taken over nonnegative integers k, I. It is also convenient 
to consider the approximations to this defined by 

08.3) KM*)=™ <nir ^i:\Tm*)\- 

Thus A^(f)(x) is monotone increasing in n, and tends to A*(f)(x) as n — > oo. 
As usual, A* and A* n are sublinear operators that send bounded functions to 
bounded functions, with 

(38.4) ||^L(/)||oo < ll^*(/)Hoo < H/lloo- 

By construction, they also commute with T, which is to say that 

(38.5) A* n (T(f)) = T(A* n (f)), A*(T(f)) = T(A*(f)). 
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In particular, 

(38.6) p({x e X : A* n (T l (f))(x) > A}) = ^({x G X : A* n (f)(x) > A}) 
for each n, I, and A, and 

(38.7) f A* n (T l (f)fd^= f A* n (f) p d» 
Jx Jx 

for each n, and p. 

Let N be a large positive integer. If / is a measurable function on X, then 
let F(x,j) be the function on X x Z defined by 

(38.8) F(;r,.7)=T'' (/)(*) 

when < j < N, and F(x,j) — otherwise. Because is measure-preserving, 

(38.9) Yl / \F{^3)\ P dKx) = {N + l) \f\? dp 

j=-oo'' X 

for each p. Let F*(x,l) be the maximal function in the second variable, as in 
the previous section. If n < I < N — n, then 

(38.10) A* n (T l (f))(x)<F*(x,l). 
It follows that 

(38.11) (N — In) »({x e X : A* n {f){x) > A}) 

oo 

< ]T n({x e X : F*(x,j) > A}) 

j=-oc 

for each A, and hence 

(38.12) {N — 2n) p({x e X : A* n (f)(x) > A}) 

o 00 r 

j = -oc X 

= {N + i)\j x \f\dn. 

Equivalently, 

(38.13) e X : A* n (f)(x) > A}) < \ ^ l/l ^ 
for every N > 2 n, and so 

(38.14) M ({x e X : <(/)(*) >\})<\J x |/| d M , 
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which implies that 

(38.15) fi({x e X : A*(f)(x) > A}) < | J \f\dfM 



for every A. 

Similarly, for each p, 



(38.16) (N — 2n) I A* n {ffd»< ]T / F*(x,j) p d[i(x). 

J* j=-oo Jx 

If 1 < p < oo, then 

(38.17) (N-2n) [ A* n (f) p dfi 

Jx 



< Ap2P 



^— J2 / \F{x,j)\>drtx) 
P — 1 JX 



Thus 



( 

when N > 2 n, which implies that 



(38.18) / K(fY>d»< / |/|*d/x 



(38.19) / A * n (frd»<^-^- [ \f\Pdf,, 
Jx P - 1 Jx 

and hence 

(38.20) / A * {f) P dll <^l f d/z. 
Jx P - 1 Jx 

39 Almost-everywhere convergence 

A standard application of maximal function estimates is to the existence of 
pointwise limits almost everywhere. For example, if / is a locally integrable 
function on the real line, then 



(39.1) 



j\j\f(v)-m\dy^V 



for almost every x G R, where the limit is taken for open intervals / with x £ I 
as |/| — > 0. One may as well suppose that / is integrable here, or even has 
compact support, since the problem is local. The limit is trivial when / is 
continuous, and any integrable function can be approximated in the L 1 norm 
by a continuous function. Using maximal function estimates, one can show that 
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the approximations by continuous functions also work for the pointwise limits 
almost everywhere. 

One can also show that maximal functions associated to the Abel and Cesaro 
sums of the Fourier series of an integrable function are bounded by the analogue 
of the Hardy-Littlewood maximal function on the circle. Hence these maximal 
functions satisfy the same sort of estimates as before. This can be used to show 
that the Abel and Cesaro sums converge to the function almost everywhere, 
which could be derived from the limit described in the previous paragraph as 
well, and which is basically the same type of statement anyway. 

Suppose now that A is a set with a positive measure /j,, (f> is a measure- 
preserving transformation on A , and that T(f) = f o<f> for measurable functions 
/ on X. If / e LP(X), 1 < p < oo, then 

oo 

= Yl TT~ / \f\ p dfi< oo. 



Hence 

(39.3) V«<. 

n— 

for almost every x G X, and thus 

(39.4) iim mm = o 

n^oo n + 1 
for almost every ir G X . Of course, 

(39.5) ll^"(/)l|oo = \\fU _> Q 
v y n+ 1 n+1 

as n — > oo when / is bounded A. 

For each x G A, consider the sequence of averages 

n 

(39.6) ^{f)(x). 
If T(/) = /, then 

(39.7) Y,T\f){x)=f{x) 

3=0 

for each n, and converge as n — > oo trivially. If / = T(b) — b for some b G L P (A), 
1 < p < oo, then 

(39.8) J-±THf ){x) = ^ 1 W x \- b M 
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converges to almost everywhere on X as n — > oo, by the remarks in the 
preceding paragraph. As in Section 28, the functions of the form 

(39.9) a + (T(b) - b), a,b 6 L 2 {X), T{a) = a, 

are dense in L 2 (X), since T defines a unitary transformation on L 2 (X). Thus the 
averages (39.6) converge almost everywhere on X for a dense class of functions 
in L 2 (X). Using the boundedness of the maximal function on L 2 (X), one can 
show that the averages converge almost everywhere for any / £ L 2 (X). This 
also holds for / £ L P (X) when 1 < p < oo, because L P (X) n L 2 (X) is dense in 
L P (X), and using the maximal function estimates on L P (X). 

40 Multiplication operators 

Let (X, n) be a measure space, and let b be a bounded measurable function on 
X. The corresponding multiplication operator is defined by 

(40.1) T(f) = bf. 

This determines a bounded linear operator on L P (X) for 1 < p < oo, with 
operator norm equal to the essential supremum norm of b for each p. 

In particular, if {bj}j is a sequence of bounded measurable functions on X 
that converges to b in the L°° norm, then the corresponding sequence {Tj}j of 
multiplication operators converges to T in the operator norm on L P {X) for each 
Pi 1 < P < °°- If instead {bj}j is a sequence of bounded measurable functions 
with uniformly bounded L°° norms that converges to b pointwise almost every- 
where on X, then the dominated convergence theorem implies that {Tj(f)}j 
converges to T(f) in the LP norm for every / £ L P (X) when 1 < p < oo. 

For example, if ||&||oo < 1, then b J converges to in the L°° norm as j — > oo. 
If H&Hoo = 1 and |£>(a;)| < 1 for almost every x E X, then H^ Hoo = 1 for each j 
and linij^oo b(x) J = almost everywhere on X. 

Suppose that ||6||oo < 1 ; and consider 

(40.2) a „ = _l_^^. 

3=0 

Thus 

1 " 

Clearly ||a n ||oo ^ 1 f° r each n. As in Section 2, {a n } n converges pointwise 
to the function that is equal to 1 when b is equal to 1 and to elsewhere. 
Hence the sequence of averages of the Tj 's converges strongly to the operator of 
multiplication by the characteristic function of the set where b = 1 on L P (X) 
when 1 < p < oo. 
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41 Doubling spaces 



A metric space (M, d{x, y)) is said to be doubling if there is a C > such that 
every ball in M of radius r is contained in the union of < C balls of radius r/2. 
Thus every ball of radius r can also be covered by < C 2 balls of radius r/4, etc. 
Equivalently, M is doubling if there is a C > such that every set in M of 
diameter i can be covered by < C subsets of diameter < t/2. 

For example, it is easy to see that finite-dimensional Euclidean spaces are 
doubling. In this case, it suffices to consider coverings of the unit ball, because 
a covering of any other ball can be reduced to the unit ball using translations 
and dilations. A similar argument works on nilpotcnt Lie groups under suitable 
conditions. If (M, d(x, y)) is a metric space which is doubling and E C M, 
then E is doubling with respect to the restriction of d{x, y) to E. In particular, 
subsets of finite-dimensional Euclidean spaces are doubling with respect to the 
induced metric. 

Let M be a metric space with a doubling measure /i, and let B be a ball 
of radius r in M. Suppose that A C B has the property that d(x,y) > r/2 
for every x,y G A with x ^ y. Thus the balls B(x,r/2), x G A, are pairwise 
disjoint, and B(x,r/2) C (3/2)5 for every x G B, and hence 

(41.1) ^M^,r/2))<M(3/2)S). 

This implies an upper bound for the number of elements of A in terms of the 
doubling constant for n, since 

(41.2) BCB{x,2r) 

for each x G B, so that /u(-B) is less than or equal to a constant multiple of 
fj,(B(x, r/2)). If A is a maximal set of this type, then for each z G B there is an 
x G A such that d(x, z) < r/2, which means that 

(41.3) B C \J B(x,r/2), 

xeA 

from which it follows that M is doubling. 

Remember that a set E is a metric space M is said to be totally bounded 
if for each e > 0, E can be covered by finitely many balls of radius e. Totally 
bounded subsets of any metric space are automatically bounded, and bounded 
subsets of a doubling metric space are totally bounded. If M is complete, then 
K C M is compact if and only if K is closed and totally bounded. If M is also 
doubling, then it follows that K C M is compact if and only if K is closed and 
bounded. 

42 Ultrametrics 

A metric d(x, y) on a set M is said to be an ultrametric if 
(42.1) d(x, z) < m&x(d(x, y), d(y, z)) 
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for every x,y,z <G M, which is stronger than the usual triangle inequality. Some 
examples of ultrametric spaces are given in the next section. 

Let (M,d(x,y)) be an ultrametric space. If x, y E M and r,t > 0, then 
either 

(42.2) B(x,r) C B(y,t), 
or 

(42.3) B(y,t)CB(x,r), 
or 

(42.4) B(x,r)nB(y,t) = 9. 

More precisely, the first alternative holds when d(x, y) and r are less than or 
equal to t, the second alternative holds when d(x, y) and t are less than or equal 
to r, and the third alternative holds when r and t are strictly less than d(x, y). 

Suppose that B is a collection of open balls in M. The remarks of the 
previous paragraph imply that the maximal elements of B are pairwise disjoint. 
Under suitable conditions, every element of B is contained in a maximal element. 

This is a very simple covering argument for ultrametric spaces. Doubling 
conditions are still important, even if they are not required for a covering lemma. 



43 Sequence spaces, 2 

Let A be a set with at least two elements, and let M be the set of sequences 

x = {xj}J!L 1 with Xj G A for each j. Fix a positive real number p < 1, and put 

(43.1) d(x,y) = p n 

when x,y G M, x ^ y, and n is the largest nonnegative integer such that 

x i = % f° r j — n ' an d otherwise put d(x, y) — 0. This defines an ultrametric 
on M for which the corresponding topology is the product topology, when M 
is considered as the product of infinitely many copies of A equipped with the 
discrete topology. Let us restrict our attention to the case where A has only 
finitely many elements, so that M is compact. It is easy to see that M is 
doubling in this case. Let w(a) be a positive real number for each a E A whose 
sum is equal to 1. This defines a probability measure on A which leads to a 
probability measure p on M, and one can check that p is a doubling measure 
on M. 



44 Snowflake transforms 

If a, x, y are nonnegative real numbers and < a < 1, then 

(44.1) (x + y) a <x a +y a . 
This is because 

(44.2) x + y<(x a + y a )umx{x,y) 1 - a 
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and 

(44.3) max(x,y) < (x a + y a ) 1 / a 

imply that x + y < (x a +y a ) 1/a . 

If (M,d(x,y)) is a metric space and < a < 1, then it follows that d(x,y) a 
is also a metric on M. If d(x,y) is an ultrametric on M, then d(x,y) a is an 
ultrametric on M for every a > 0. Note that d(x,y) a determines the same 
topology on M as d(x, y) in both situations. 

For each p£M and r > 0, the open ball in M centered at p with radius r 
with respect to d(a;, y) is the same as the open ball centered at p with radius r a 
with respect to d(x, y) a . Using this fact, it is easy to see that M is doubling with 
respect to d(x, y) if and only if it is doubling with respect to d(x, y) a . Similarly, 
a positive Borel measure /i on A is doubling with respect to d{x, y) if and only 
if it is doubling with respect to d(x, y) a . Of course, the doubling constants for 
d(x,y) a are not normally the same as for d(x,y). 

The diameter of a set E C M with respect to d(x,y) a is equal to the ath 
power of the diameter of E with respect to d(x,y). It is easy to see that the 
i-dimensional Hausdorff measure of E C M with respect to d(x, y) is equal to 
the (i/a)-dimensional Hausdorff measure of E with respect to d(x, y) a , using the 
preceding observation and the definition of Hausdorff measures. In particular, 
the Hausdorff dimension of E with respect to d(x,y) a is equal to the Hausdorff 
dimension of E with respect to d(x, y) divided by a. 



45 Invariant measures 

Let A be a compact Hausdorff topological space, and let C(X) be the space of 
continuous real-valued functions on X equipped with the supremum norm. A 
linear functional A on C{X) is said to be bounded if it is bounded as a linear 
mapping into R as a one-dimensional vector space with the absolute value as 
norm. Thus A is bounded if there is an A > such that 

(45.1) |A(/)|<A||/|U 

for every / G C(X), in which case the dual norm ||A||* of A is the smallest value 
of A for which this condition holds. This defines a norm on the vector space 
C(X )* of bounded linear functionals on C(X), which is the same as the operator 
norm of A as a linear mapping from C(X) into R. 

A linear functional A on C(X) is said to be nonnegative if 

(45.2) A(/) > 

for every / G C(X) which is nonnegative in the sense that f(x) > for every 
x G X. If A is a nonnegative linear functional on C(X), then A is bounded, and 

(45.3) l|A||* = A(l), 

where the right side means A applied to the constant function equal 1 on A. 
Every nonnegative linear functional on C(X) can be expressed by integration 
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with respect to a positive Borel measure on X, by the Riesz representation 
theorem. 

A set U C C(X)* is an open set in the weak* topology if for every A 6 U 
there are finitely many continuous functions fi, - ■ ■ ,f n on A and positive real 
numbers r\ , . . . r n such that 

(45.4) {A' e C(Xy : |A'(A) - A(/i)| < n, . . . , |A'(/„) - A(/„)| < r n } C (7. 
The Banach-Alaoglu theorem implies that 

(45.5) B* = {\ e C(X)* : ||A||* < 1} 

is compact with respect to this topology. If the topology on X is determined 
by a metric, then C(X) is separable, and one can show that the topology on B* 
induced by the weak* topology is determined by a metric. 
Let <j) be a homeomorphism of X onto itself, and let 

(45.6) T(f) =fo4> 

be the corresponding linear operator on C(X). This leads to a dual linear 
operator T* on C(X)*, defined by 

(45.7) T*(A)(/) = A(T(/)). 

Note that T(f) > when / > 0, and hence T*(A) is nonnegative when A is 
nonnegative. Similarly, T preserves the supremum norm on C(X), and so T* 
preserves the dual norm on C(X)*. Of course, T automatically sends constant 
functions to themselves on X. 
Consider 

n 

(45.8) A„= J2(T*Y(\) 

3=0 

for each nonnegative integer n and A G C(X)*. Observe that 

(45.9) T*(A„) - A n = -1 T ((T*)"+ 1 (A) - A), 
which implies that 

(45.10) ||T*(A„)-A„||» < 2 " A " 



n+l 
In particular, 

(45.11) lim ||T*(A„)-A„||*=0 

n— >oo 

Let ^/ be the closure of A/, A/+i, ... in the weak* topology. By compactness, 

oo 

(45.12) E=f)Ei^(fr. 

1=0 
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If the topology on X is determined by a metric, then there is a subsequence of 
Ao, Ai, . . . that converges in the weak* topology, and whose limit is in E. Every 
element of E is invariant under T* , by the remarks of the previous paragraph. 

If A is nonnegative, then A„ is nonnegative for each n, and every element of 
E is nonnegative. In this case, A n (l) = A(l) for each n, and every element of E 
has the same value when applied to the constant function 1. It follows that A„ 
and every element of E has the same norm as A when A > 0. Using this, one 
can check that there is a nonnegative linear functional on C{X) with norm 1 
which is invariant under T* , and which corresponds to an invariant probability 
measure on X. 
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